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We have designed a novel non-antibody scaffold pro- 
tein, termed Adhiron, based on a phytocystatin con- 
sensus sequence. The Adhiron scaffold shows high 
thermal stability (r„ ca. 101°C), and is expressed well in 
Escherichia coli. We have determined the X-ray crystal 
structure of the Adhiron scaffold to 1.75 A resolution re- 
vealing a compact cystatin-like fold. We have constructed 
a phage-display library in this scaffold by insertion of 
two variable peptide regions. The library is of high 
quality and complexity comprising 1.3 x 10^** clones. To 
demonstrate library efficacy, we screened against the 
yeast Small Ubiquitin-like Modifier (SUMO). In selected 
clones, variable region 1 often contained sequences hom- 
ologous to the known SUMO interactive motif (V/I-X-V/ 
I-V/I). Four Adhirons were further characterised and 
displayed low nanomolar affinities and high specificity 
for yeast SUMO with essentially no cross-reactivity to 
human SUMO protein isoforms. We have identified bin- 
ders against >100 target molecules to date including as 
examples, a fibroblast growth factor (FGFl), platelet 
endothelial cell adhesion molecule (PECAM-1; CD31), 
the SH2 domain Grb2 and a 12-aa peptide. Adhirons are 
highly stable and well expressed allowing highly specific 
binding reagents to be selected for use in molecular rec- 
ognition applications. 

Keywords: consensus protein/high specificity binding/ 
non-antibody-binding protein/protein-protein interaction/ 
SUMO 



Introduction 

Antibodies are the most commonly used binding proteins with 
>240 candidates in clinical development (Reichert, 2010) and 
remain extremely important in scientific research, diagnostics 
and therapy. Nevertheless, antibodies have a number of limita- 
tions; they are large multimeric proteins that require disulphide 
bonds and glycosylation for stability. They are usually difficult 
to express in bacterial systems and are often highly sensitive to 
elevated temperatures. Antibody production is generally ex- 
pensive, relies upon the use of animals and mammalian cell 
culture, while for clinical applications they must be engineered 
as humanised versions. Protein engineering has been exploited 
to develop alternative non-antibody binding proteins which 
mimic the molecular recognition properties of antibodies but 
with some improved properties. 

Some 50 novel non-antibody protein scaffolds have been 
designed to constrain and present variable peptide sequences 
for protein recognition (Skerra, 2007; Gebauer and Skerra, 

2009) . These include designed ankyrin repeat proteins 
(DARPins) (Binz et al, 2003), Repebodies (Lee et ai, 2012), 
Anticalins (Schlehuber and Skerra, 2005), Fibronectins (Koide 
et ai, 1998), Affibodies (Nord et ai, 1995) and engineered 
Kunitz domains (Nixon and Wood, 2006). Artificial binding 
proteins are in general small (<200 aa), monomeric, stable 
and easy to express in Escherichia coli. Most do not contain 
cysteines enabling the introduction of a cysteine for site- 
specific coupling of biotin, fluorescent labels, or polyethylene 
glycol to enhance their utility or stability. These characteristics 
make artificial binding proteins powerful tools capable of re- 
placing antibodies in a range of applications including re- 
search (Wojcik et ai, 2010), diagnostics (Theurillat et ai, 

2010) , in vivo drug discovery studies (Grebien et al., 2011; 
Parizek et al, 2012) and a novel class of therapeutics (recently 
reviewed (Gebauer and Skerra, 2009; Wurch et al, 2012)) in- 
cluding multivalent and/or multi-specific protein therapeutics 
(Carter, 201 1). A potential disadvantage of small artificial pro- 
teins for therapeutic purposes is a short circulatory half-life; 
however, this can be overcome by fusing them to larger pro- 
teins such as albumin or by using PEGylation or PASylation 
(XL-Protein GmbH) or XTEN (Schellenberger et al, 2009) 
approaches. 

An important consideration when designing artificial protein 
scaffolds is their thermostability. In general, there is a correl- 
ation between thermal stability and other aspects of protein 
stability, so a highly stable scaffold enables long-term storage 
at ambient temperature and also broadens utility in a range of 
processes including heat purification and options for the stor- 
age of reagents and administration of therapeutics. Insertion of 
peptide loops into a scaffold often leads to a decrease in protein 
stability so it is desirable to select a very stable scaffold for 
combinatorial library generation. One approach to generate 
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Stable proteins is the consensus design concept (Steipe et al. , 
1994). This is based on the premise that sequence conservation 
arises from evolutionary pressure to maintain stability elements 
(Mosavi et al, 2002). While any natural protein is expected 
only to have evolved the level of stability required for it to effi- 
ciently perform its function, a consensus protein reinforces 
structural stability. This approach has been successfully used to 
improve the thermostability of enzymes (Lehmann et al., 2000; 
Komor et al., 2012), antibodies (Knappik et al., 2000) as well as 
artificial binding proteins (Main et al., 2003; Forrer et al., 2004; 
Jacobs et al., 2012). 

In the present study, we describe a new artificial binding 
protein scaffold that we have named Adhiron. This scaffold is 
based on a consensus sequence of plant-derived phytocysta- 
tins, which are small (ca. 100 aa) protein inhibitors of cysteine 
proteases (Kondo et al., 1991). This consensus protein dis- 
played very good protease inhibitor activity and also the 
requirements (small, monomeric, high solubility and high sta- 
bility and the lack of disulfide bonds and glycosylation sites) 
for a good scaffold for peptide presentation. We chose to 
replace inhibitory sequences within the Gin Val Val Ala Gly 
and Pro Trp Glu loops of the consensus phytocystatin with 
nine randomized amino acid positions in each loop. 

To evaluate the functionality of our scaffold and the phage- 
display library, we chose yeast SUMO protein as a model target 
for screening. Four different yeast SUMO-binding Adhirons 
were selected and characterized for binding affinity, stability 
and specificity. In addition, we have selected specific binders 
against some 100 distinct molecules between 2011 and 2013 
including, as examples, fibroblast growth factor (FGFl), cell 
adhesion molecule CD31, SH2 domains and a 12-aa peptide. 



Materials and methods 

Construction of the Adhiron library 

A consensus sequence derived from alignment of 57 phyo- 
cystatin sequences was developed to enhance phytocystatin 
properties. To design the consensus phytocystatin gene, a 
tBLASTN search of the Genbank database was performed 
using OSA-I {Oryza sativa; U54702), ZMA2 {Zea mays; 
D38130) and HANI (Helianthus annuus; Q10993) protein 
sequences as search probes. The accession codes and sources 
of sequences used to derive the consensus sequence are shown 
in Supplementary Table SI. These coding sequences were 
translated and aligned using the program MULTALIN (Corpet, 
1988). This consensus protein was then further modified and 
truncated for use as a non-antibody protein scaffold, termed 
Adhiron, and an E. coli codon optimised gene was synthesised 
(GenScript) (Fig. lA). The Adhiron scaffold coding region 
was cloned between Nhel and NotI restriction sites in a phage- 
mid vector pBSTGl (GenBank accession number KJ474865) 
developed from pHENl (Hoogenboom et al., 1991). Cloning 
into pBSTGl creates a fusion coding sequence encoding a 
DsbA secretion signal peptide, Adhiron, TAG codon and 
C-terminal half of gene III of bacteriophage M13 (Fig. IB). 
The resulting phagemid is referred to as pBSTGl-Adh. The 
in-frame amber (TAG) stop codon allows translational read- 
through to create an Adhiron-truncated-pIII fusion protein in a 
suppressor E. coli strains such as ER2738 (F'proA^B^ laclq 
A(lacZ)M15 zz/::TnlO(TetR)//7iMA2 glnV A(lac-proAB) thi-1 
\(hsdS-mcrB)5) but not in a non-suppressor strains such as 



JM83 (ara A(lac-proAB) rpsL (Sf^)[ip80 dlacA(lacZ)M15] 
thi). 

The Adhiron library was constructed by splice overlap ex- 
tension (SOE) of two PGR products (Horton et al, 1990) and 
all primers were synthesised by Ella Biotech. The first PGR 
product extended from the DsbA coding sequence to the first 
variable loop and was generated by the primers: Forward 
primer 5'-TCTGGCGTTTTCTGCGTC-3' and Reverse primer 
5'-CTGTTCTTTCGCTTTAACAAC-3'. The second PGR 
product introduced two nine amino acid variable regions (VRs) 
into the scaffold using the following primers. The PstI site 
used for cloning is underscored: Forward_VR 5'-GTTGTTA 
AAGCGAAAGAACAGNNNNNNNNNNNNNNNNNNNNN 
NNNNNNACCATGTAGTAGGTGAGGCTG-3' and Reverse. 
VR 5'-CTGCGGAAGTG GTGCAG TTCTTTGAAGTTNNNN 
NNNNNNNNNNNNNNNNNNNNNNNCTTAACGCAAAG 
TTTCGGTTGG-3'. 

The degenerate positions (NNN) were introduced as trimers 
representing a single codon for each of the 19 amino acids ex- 
cluding cysteine and there were no termination codons. PGR 
used Phusion High Fidelity Polymerase (NEB) at 98°C for 
5 min then 20 cycles of 98°G, 10 s; 56°G, 15 s; 72°G, 15 s fol- 
lowed by 72°G for 5 min. PGR products were purified by gel 
extraction (Qiagen), used for 10 cycles of SOEing as above, 
then digested with Nhel and PstI, gel extracted and cloned into 
the similarly digested pBSTGl-Ad phagemid. 

Ligated products were electroporated into E. coli ER2738 
electrocompetent cells (Lucigen). In total, 20 ml of ER2738 
cells was electroporated with 126 ng of library DNA per 50 |xl 
of ER2738 cells. After a 1-h recovery in 2TY medium, cells 
were grown at 37°G and 225 rpm to an ODgoo of 0-6 in 5 1 of 
2TY medium. M13K07 helper phage (lO") were added with 
shaking at 37°G and 100 rpm for 1 h followed by overnight at 
25°G with kanamycin (50 |JLg/ml). Phage were precipitated 
with 6% polyethylene glycol 8000 and 0.5 M NaGl, and were 
suspended in 50% glycerol for storage. 

Target preparation and phage display 

Yeast SUMO (ySUMO) protein was expressed in BL21 (DE3) 
cells using isopropyl (3-D-l-thiogalactopyranoside (IPTG) in- 
duction and purified by Ni-NTA resin (Qiagen) affinity chro- 
matography according to the manufacturer's instructions. 
Purity was confirmed by sodium dodecyl sulphate polyacryl- 
amide gel electrophoresis (SDS-PAGE). The following proto- 
cols are described for yeast SUMO with an identical protocol 
used for the screening of other targets. Yeast SUMO was bioti- 
nylated using EZ-link NHS-SS-biotin (Pierce), according to 
the manufacturer's instructions. Biotinylation was confirmed 
using streptavidin conjugated to horseradish peroxidase 
(HRP). Biotinylated ySUMO was bound to streptavidin-coated 
wells (Pierce) for 1 h, then lO'^ cfu pre-panned phage were 
added for 2.5 h with shaking. Panning wells were washed 10 
times and eluted with 50 mM glycine-HGl (pH 2.2) for 
10 min, neutraUsed with IM Tris-HGL (pH 9.1), further 
eluted with triethylamine 100 mM for 6 min, and neutralised 
with 1 M Tris-HGl (pH 7). Eluted phage were used to infect 
ER2738 cells for 1 h at 37°G and 90 rpm then plated onto LB 
agar plates with 100 |JLg/ml carbenicillin and grown overnight. 
Golonies were scraped into 5 ml of 2TY medium, inoculated 
in 25 ml of 2TY medium with carbenicillin (100 |jLg/ml) and 
infected with ca. 1 x lO' M13K07 helper phage. After 1 h at 
90 rpm, kanamycin was added to 25 |xg/ml for overnight at 
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helix 



GVRAVPGNENSLEIEELA 
GCTACCGGTGTTCGTGCAGTTCCGGGTAACGAAAACTCCCTGGAAATCGAAGAACTGGCT 



P-strand 1 



RFAVDEHNKKENALLEFVRV 
CGTTTCGCTGTTGACGAACACAACAAAAAAGAAAACGCTCTGCTCGAATTCGTTCGTGTT 

LOOPl 

xxxxxxxxx. 







V K A K E Q 


V V A G 



P-strand2 



TMYYLTLEAK 
GTTAAAGCTAAAGAACAGGTTGTTGCTGGTACCATGTACTACCTGACCCTGGAAGCTAAA 

LOOP2 

xxxxxxxxx 



P-strand 3 



K K K 



E A K V W V K 



W E 



N 



GACGGTGGTAAAAAGAAACT6TACGAAGCTAAAGTTTG6GTTAAACCGTGGGAAAACTTC 



p-strand 4 



KELQEFKPVGDA 
AAAGAACTGCAGGAGTTCAAACCGGTTGGTGACGCT 



B 




DsbA Signal Sequence 
Nhd 

Adhiron 

Psil 

NotI 
HisTag 
Amber Stop 
Truncated gplll 



AmpR 



Fig. I . Adhiron coding region and phagemid vector. (A) Codon optimized coding sequence and amino acid sequence of the Adhiron92 scaffold with secondary 
structure elements indicated. The residues that are replaced by the nine randomized amino acids (X) to form LOOPl and LOOPl in the Adhiron library are boxed. 
In place of the N-terminal residues AlaThrGly, the original consensus sequence contained the N-terminal sequence AlaAlaLeuLeuGlyGly. (B) pBSTGl phagemid 
vector containing the coding region for Adhiron92 indicating relevant features of the construct. 



25 °C and 170 rpm. Phage were precipitated with 4% poly- 
ethylene glycol 8000, 0.3 M NaCl and resuspended in 1 ml of 
10 mM Tris, pH 8.0, 1 mM EDTA (TE buffer). A 2 |jl1 aliquot 
of phage suspension was used for the second round of selec- 
tion using streptavidin magnetic beads (Invitrogen). Yeast 
SUMO labelled beads were washed and incubated with pre- 
panned phage for 1 h then washed five times using a 



KingFisher robotic platform (ThermoFisher) and eluted and 
amplified as above. The final pan used neutravidin high 
binding capacity plates (Pierce), as previously described for 
panning round one, with phage eluted using 100 |xl of 
100 mM dithiothreitol. Phage were recovered from wells con- 
taining target protein and control wells to determine the level 
of amplification in target wells. 
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Phage ELISA 

Individual ER2738 colonies were grown in 100 |jl1 of 2TY 
with 100 |JLg/ml of carbenicillin in a 96-deep well plate at 
37°C (900 rpm) for 6 h. A 25 |jl1 aliquot of the culture was 
added to 200 |xl of 2TY containing carbenicillin and grown at 
37°C (900 rpm) for 1 h. Helper phage (10 |jl1 of 10"/ml) were 
added, followed by kanamycin to 25 |JLg/ml overnight and 
incubated at 25°C (450 rpm). Streptavidin-coated plates 
(Pierce) were blocked with 2 x casein blocking buffer 
(Sigma) overnight at 37°C. The plates were incubated with 
biotinylated yeast SUMO or biotinylated linker for 1 h, and 
45 |jl1 of growth medium containing the phage was added and 
incubated for 1 h. Following washing, phage were detected by 
a 1 : 1000 dilution of HRP-conjugated anti-phage antibody 
(Seramun) for 1 h, visualised with 3,3',5,5'-tetramethylbenzi 
dine (TMB) (Seramun) and measured at 610 nm. 

Adhiron protein production 

The DNA coding sequences of Adhirons were amplified by 
PGR, restriction digested with Nhel and PstI and cloned into 
pETlla containing the Adhiron scaffold similarly digested. 
Plasmid DNA was purified (Qiagen) from transformant col- 
onies and sequenced to confirm the correct insert. Following 
transformation into BL21 (DE3) cells (F~ ompT gal dcm Ion 
hsdSslrB nis) X(DE3 [lad lacUV5-T7 gene 1 indl sam7 
ninS]) 400 ml cultures in LB medium were induced with 
0. 1 mM IPTG. Cells were grown for 6 h, harvested and lysed 
with Bugbuster (Novagen). The clear supernatant was mixed 
with 500 |xl of Ni-NTA resin slurry (Qiagen) for 1 h, washed 
three times in (50 mM PBS, 500 mM NaCl, 20 mM imidazole, 
pH 7.4) and eluted in 3 x 1 ml of elution buffer (50 mM PBS, 
500 mM NaCl, 300 mM imidazole, pH 7.4). One hundred 
micrograms of the SUMO-binding Adhirons were biotinylated 
using NHS SS-biotin (Pierce) according to the manufacturer's 
instructions for use in enzyme-linked immunosorbent assay 
(ELISAs) and western blotting. 

EL/SA analysis with purified Adhirons 

Unless otherwise indicated 5 ng of target protein in PBS was 
absorbed onto Immuno 96 Microwell™ Nunc MaxiSorp™ 
plate wells overnight at 4°C. Two hundred microlitre of 3 x 
blocking buffer was added at 37°C for 4 h with no shaking. 
Biotinylated yeast SUMO-binding Adhirons (Ad-ySUMO) at 
0.1 |xg/ml in Phosphate-buffered saline with Tween 20 
(PBST) containing 2x blocking buffer were incubated in 
wells for 1 h with shaking. Wells were washed three times in 
PBST, and streptavidin conjugated to HRP (Invitrogen) 
diluted 1 : 1000 in 50 |jl1 PBST was added for 1 h. After 
washing Adhiron binding was visualised with 50 |xl TMB 
(Seramun) and the absorbance measured at 610 nm. 

Western blot analysis 

Target protein alone or mixed with HEK293 cell lysate 
(20 |JLg) was mixed with loading buffer (60 mM Tris-Cl pH 
6.8, 2% SDS, 10% glycerol, 5% (B-mercaptoethanol, 0.01% 
bromophenol blue), boiled for 3 min, centrifuged for 1 min at 
15 OOOg and then resolved in a 15% SDS-polyacrylamide gel. 
Proteins were transferred to PVDF membranes for 45 min at 
4 W (Amersham Biosciences) and incubated for 1 h in block- 
ing buffer (5% BSA in PBS 0.1% Tween) followed by incuba- 
tion for 1 h with Ad-ySUMO (100 |xg/ml diluted 1 : 1000 



PBST). Bound Ad-ySUMOs were detected using streptavidin- 
conjugated HRP and chemiluminescence (ECL Plus kit, 
Amersham). 

Mass spectrometry analysis of Adhirons 

Adhiron samples in PBS were buffer exchanged into 50 mM 
ammonium acetate using Zeba spin columns (7K MWCO; 
Thermo Scientific) and 20 ml samples of 20 mM protein were 
analysed on a Synapt HDMS (Waters UK Ltd) electrospray 
mass spectrometer. 

Protein-protein interaction affinity measurement 

The BLitz™ (ForteBio) dip and read streptavidin biosensors 
were used to estimate the affinity of ySUMO binding to immo- 
bilised biotinylated Ad-ySUMO binders, according to the 
manufacturer's instructions. At least four readings at different 
ySUMO concentrations (0.25-1 mM) were used for each 
Ad-ySUMO and a global fit was used to calculate binding af- 
finities. The values obtained were comparable with affinities 
measured using a Biacore SPR3000 instrument. 

DSC 

Adhiron thermal stability was measured by differential scan- 
ning calorimetry (DSC) using a VP-DSC MicroCalorimeter 
(GE Healthcare). Each sample was dialysed extensively 
against PBS, pH 7.4 and diluted to a concentration of 1 mg/ml 
before heating from 10 to 100°C for Adhiron library clones or 
to 120°C for the Adhiron scaffold 92 and 81 amino acid pro- 
teins at an upscan rate of 90°C per hour. A buffer only scan 
was measured to calculate a baseline for integration. Data 
were fit to a one-state unfolding model and reversibility of 
thermal denaturation was determined by repeating the scan for 
the Adhiron scaffold sample without removing it from the cell. 

ITC 

All experiments were carried out using the iTC200 system 
(Microcal) at 25°C. Typically, 0.1 mM Adhiron protein was 
titrated with 0.01 mM of a target protein (yeast Sumo, human 
Sumol, human Sumo2) by 2.5 min injection duration to allow 
return of the titration peak to the baseline. After fitting the 
integrated exothermal peaks, Ko values were determined 
using the Origin program (OriginLab). 

CD studies 

All circular dichroism (CD) measurements were performed on 
a Chirascan CD spectrometer (Applied Photophysics) at room 
temperature. The spectra were recorded over a wavelength 
range of 260-180 nm using a cuvette of 1 mm pathlength at a 
scan speed of 60 nm/min. The concentration of each sample 
was ca. 0.2 mg/ml in PBS buffer (pH 7.4). Data were further 
processed for noise reduction, baseline subtraction and signal 
averaging. 

Structure determination 

Crystallization trials were set up using the method of Walter 
et al. (2005). Crystals for the full-length Adhiron92 scaffold 
appeared after 38 days in Well H5 of a Morpheus screen 
(Molecular Dimensions) (0.02 M glycine, 0.02 M sodium 
L-glutamate, 0.02 M DL-alanine, 0.02 M DL-lysine hydrochlor- 
ide, 0.02 M DL-serine, 0.1 M MOPS pH 7.50, 0.1 M HEPES 
sodium salt pH 7.50, 10.0% w/v polyethylene glycol 20 000, 
20.0% w/v polyethylene glycol 550 MME). The shorter 81 
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amino acid AdhironSl scaffold gave crystals after 52 days in 
Well D2 of the PACTpremier screen (Molecular Dimensions) 
(0.1 M Malic acid/MES/Tris Buffer System pH 5.0, 25.0% w/v 
Polyethylene Glycol 1500). 

For the full-length Adhiron92 rod-like crystals ~25 ixm 
across were cryo-cooled directly into liquid nitrogen without 
the addition of cryoprotectants. Diffraction data were collected 
at beamline 124 at Diamond Light Source from multiple posi- 
tions along a single crystal with the starting angle at each pos- 
ition offset by 30° with respect to the previous position. Data 
were collected to a resolution of 1.75 A using an X-ray wave- 
length of 0.9686 A with the beamsize set to be 20 x 20 jjim^ 
at the sample. For the shorter Adhiron-81 rod-like crystals 
~20 |xm across grew in clumps. Diffraction data were col- 
lected from a single 20 x 20 x 20 |jLm^ crystal fragment. 

All diffraction data were integrated using XDS (Kabsch, 
2010) and scaled using AIMLESS (Evans, 2006). Phases were 
obtained via molecular replacement using PHASER (McCoy 
et ai, 2007). A subsection of a complex of an Adhiron bound 
to a soluble protein (manuscript in preparation) was used as a 
search model. Refinement was carried out using REFMAC 
(Murshudov et ai, 2011), PHENIX (Afonine et ai, 2012) and 
COOT (Emsley et ai, 2010). Full scaling and refinement sta- 
tistics are presented in Tables I and U. 

Results and discussion 

Consensus phytocystatin 

A consensus phytocystatin sequence was derived by identify- 
ing the most common amino acid at each position of an align- 
ment of 57 phytocystatin sequences. The length of the initial 
consensus protein was set at 95 amino acids with the 
N-terminus positioned four residues before the conserved 
N-terminal glycine residue, and thus before the first |3-strand 
((31). The C-terminus was set 15 residues after the conserved 
PW motif and thus after the last (3-strand (|34). These criteria 
were based on the structures of chicken egg white cystatin 
(Bode et ai, 1988) and human stefin B (Stubbs et ai, 1990) 
from X-ray structures and of OSA-I from the nuclear magnetic 
resonance structure (Nagata etai, 2000). 



Table I. Scaling and refinement statistics for Adhiron92 



Average unit cell 35.86, 35.86, 61.85; 90, 90, 90 

Space group P4i 





Overall 


Inner shell 


Outer shell 


Low resolution limit 


35.86 


35.86 


1.78 


High resolution limit 


1.75 


9.09 


1.75 


Rmerge (all 1+ and I — ) 


0.082 


0.104 


0.897 


Rmeas (all 1+ and I — ) 


0.085 


0.108 


0.960 


Rpim (alll+ andl-) 


0.023 


0.029 


0.326 


Rmerge in top intensity bin 


0.066 






Total number of obs 


101 109 


737 


3060 


Total number unique 


7889 


62 


398 


Mean ((I)/sd(I)) 


18.9 


38.3 


2.6 


Completeness 


99.3 


99.5 


91.6 


Multiplicity 


12.8 


11.9 


7.7 


Average mosaicity 


0.14 






Estimated B factor 


27.6 






Rwork/Rfree 


0.1829/0.2287 




RMS bonds 


0.007 A 






RMS angles 


1.072° 







Residues in the favoured region 97.4% (75/77) 

of Ramachandran plot 



Table II. Scaling and refinement statistics for AdhironSl 



Average unit cell 


36.37, 36.37, 59.24; 90, 90, 90 


Spa.ce group 


P4, 








Overall 


Inner shell 


Outer shell 


Low resolution limit 


36.37 


36.37 


2.32 


High resolution limit 


2.25 


9.00 


2.25 


Rmerge (all 1+ and I — ) 


0.102 


0.053 


0.722 


Rmeas (all 1+ and I ~ ) 


0.119 


0.062 


0.839 


Rpim (all 1+ and I — ) 


0.060 


0.031 


0.423 


Rmerge in top intensity bin 


0.053 




- 


Total number of obs 


13 790 


209 


1295 


Total number uni(jue 


3709 


64 


339 


Mean((I)/sd(I)) 


8.2 


17.5 


1.8 


Completeness 


99.9 


99.5 


100.0 


Multiplicity 


3.7 


3.3 


3.8 


Average mosaicity 


0.21 






Estimated Bfactor 


33.7 






Rwork/Rfree 


0.1839/0.2477 




RMS bonds 


0.009 A 






RMS angles 


1.284° 






Residues in the favoured region 


93.5% (72/77) 





of Ramachandran plot 



Adhiron scaffold design and pliage display 

The Adhiron scaffold was derived from the phytocystatin con- 
sensus protein by initial N-terminal truncation to a 92 amino 
acid sequence which we refer to as the full-length Adhiron 
scaffold, or Adhiron92. We subsequently generated further 
N-terminal truncated versions of which the shortest is an 81 
amino acid sequence referred to as Adhiron81. The coding se- 
quence for the full-length Adhiron, codon optimised to 
enhance expression in E. coli (Fig. lA), was cloned into the 
phagemid vector pBSTGl (Fig. IB) to allow production of an 
Adhiron/truncated-pIII fusion protein in ER2738 suppressor 
cells for phage display. 

Expression of the Adhiron-pIII fusion protein was con- 
firmed by Western blot analysis with an anti-pIII antibody. 
The thermal stability of the Adhiron92 and Adhiron81 scaf- 
folds was tested by DSC and both gave identical profiles with 
a melting temperature of 101 °C (Fig. 2A). The secondary 
structure was examined by CD, and revealed a high ratio of |3 
sheet to alpha helix and random coil (Fig. 2B). Electrospray 
mass spectrometry of Adhirons expressed from pETU showed 
the predominant species to be lacking only the N-terminal 
Met, a common modification of cytoplasmically expressed 
proteins, with a minor component lacking MetAla. 

Adhiron structure 

We have determined the crystal structure of the full-length 
Adhiron92 scaffold by X-ray crystallography. The crystals 
belonged to space group of P4i and diffracted to a resolution 
of 1.75 A (Table I; PDB ID code 4N6T). The overall structure 
comprises the characteristic cystatin family fold of a four- 
strand anti-parallel |3-sheet core with a central helix (Fig. 3). 
Amino acids 1-10 and 90-92 are not visible in the electron 
density maps presumably as they are disordered. The shorter 
Adhiron81 also crystallised in space group P4i and the struc- 
ture determined to 2.25 A resolution (Table II; PDB ID code 
4N6U) was essentially identical to that of Adhiron92. The 
Adhiron structure is compact with limited unstructured loops 
and this is consistent with the very high melting temperature 
of this consensus protein. 
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Fig. 2. Characterisation of the Adhiron scaffold and hbrary. (A) DSC to 
determine the melting temperature of the Adhiron scaffold (T^ lOl'C). (B) 
Ninety-six random clones were isolated and sequenced both as the phagemid 
transformed and Adhiron phage library infected E. coli ER2738 cells. The 
percentage of each amino acid within the variable regions is shown. An ideal 
library would contain 5.26 + 2.3% of each amino acid; cysteine was not 
included in the library. (C) CD analysis of the Adhiron scaffold and of three 
Adhiron proteins containing inserts from the library. All show high (3 structure 
content. 



Library design 

The introduction of peptide encoding sequences, suitable for 
molecular recognition, was guided by the predicted loop posi- 
tions within the known structure of the rice phytocystatin 
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Fig. 3. X-ray crystal structure of Adhiron92 scaffold (PDB ID no. 4N6T) at 
1 .75 A resolution. The single alpha helix and the four anti-parallel (J strands 
are shown in white with the insertion sites for library production shown in 
black. Residues 1-10 and 90-92 are not visible in the structure and are 
presumably disordered. The structure of AdhironSl at 2.25 A resolution (PDB 
ID no. 4N6U) is essentially identical. 



OC-1 (PDB code lEQK). VR 1 was positioned between the 
first and second |3 strands with VR 2 between the third and 
fourth (3 strands (Fig. 3). Sequences encoding nine random 
amino acids (excluding cysteine) were introduced at Loops 1 
and 2 by replacing four and three amino acid codons, respect- 
ively, by using codon-selected semi-trinucleotide cassette syn- 
thesis. The phage-display library was estimated to comprise 
~1.3 X 10^ independent clones obtained with just 12.6 |xg of 
ligated DNA attributable to high ligation and electroporation 
efficiencies. To check the amino acid composition, 96 random 
clones were selected as colonies from the original transform- 
ation and revealed no bias in amino acid composition and ca. 
94% full-length clones. Following phage recovery and library 
re-infection of ER2738 cells, a further 96 clones were random- 
ly sequenced. The amino acid residue frequency (Fig. 2B) 
encoded at the phage level met the expected Poisson statistics 
of 5.26 + 2.3% for trimer synthesized oligos using a 19-aa 
mixture (Krumpe et al., 2007). Clones (86.5%) were full 
length with only 3.1% clones comprising the Adhiron scaffold 
with no inserts, and 10.4% clones showing frame shifts. 
Interestingly, all frame-shift mutants analysed occurred at the 
transition base between standard nucleotide and trimer coup- 
ling, suggesting that this step of DNA synthesis is of crucial 
importance for semi-trimer oligo synthesis. Fully trimer 
synthesised oligos might therefore be expected to further 
improve library quality. However, of the insertions and dele- 
tions, between 3 and 8% were unlikely to impact on Adhiron 
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function as the affected variants remain in frame. The high 
proportion of full-length coding regions at the level of phage- 
mid following phage packaging demonstrates the high quality 
of the library generated. 

Three random Adhiron clones containing inserts were used 
for protein purification and CD was performed (Fig. 2C). All 
three showed a high proportion of |3 structure, as found in par- 
ental phytocystatins (Irene D et al., 2012), with one protein 
displaying a higher content of |3 structure likely indicating ex- 
tension of the (3 strands from the scaffold into the insert 
regions. This demonstrates that the scaffold can tolerate inser- 
tion of peptides in these loop regions and these do not disrupt 
the secondary structure of the scaffold. 

Library screening 

For library evaluation, the ySUMO protein was used as a 
model target. Over 1000- fold amplification in colony recovery 
was observed compared with control samples by panning 
round three. Twenty-four Adhirons were isolated and their 
ability to bind to ySUMO was confirmed by phage ELISA 
(Fig. 4A) with little or no binding to the control wells. The 
clones were sequenced and 22 distinct Adhirons termed 
Adh-ySUMO 1-22 were identified with the amino acid 
sequences of the VRs shown in Table III. 

Ad-ySUMO 4 and 5 are identical and 16 and 22 are also 
identical. Interestingly Ad-ySUMO 15 and 23, as well as 21 



and 22, contain the same amino acid sequence in VR 1 but dif- 
ferent sequences in VR 2. Analysis of the sequences allowed 
identification of a commonly occurring SUMO Interacting 
Motif (SIM) (Kerscher, 2007; Li et al, 2010; Sun and Hunter, 
2012) sequence IDLT in Positions 1-4 of VR 1 in 12 of the 
Ad-ySUMOs, indicating that this is likely to be an important 
motif in binding to at least one epitope on ySUMO. This SIM 
motif was not found in VR 2 in any of the clones analysed. 
Interestingly, the IDLT motif is similar to the human SUMO 1 
binding site of the MEF2 E3 ligase PIASx (VDVIDLT) (Song 
et al, 2004; Song et ai, 2005). Either a P or G was identified 
at Position 1 of VR 2 in nine different Adhirons while a P or G 
occurs in a position between Residues 2-5 in another 6 
Adhirons potentially indicating that some structural feature 
may be important in binding. For example, clones ySUM021, 
22, 23 and 24 which have the IDLT motif in VR 1 have the 
motif PXi_3(N/Q)(W/F/Y) or G(L,I), which is not found in 
VR 1. We also identified a common pattern (W/F/Y)(E/D)2_ 
4(W/F/Y) represented in both VRs. This molecular pattern 
matches the criteria for SIMs; a high percentage of hydropho- 
bic amino acids juxtaposed with acidic residues (Song et al., 
2005), suggesting that both VRs may act independently, but 
contribute to the overall binding. 

Four Ad-ySUMOs were selected for further characterisa- 
tion, Clones 15 and 22 as the VR 1 sequence occurred multiple 
times. Clone 10 as it contained the IDLT motif and Clone 19 
as it contained a distinct motif in VR 1 . 
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Fig. 4. Isolation and purification of yeast SUMO-binding Adhirons. (A) Phage ELISA of Adhirons from 24 clones incubated in wells containing ySUMO (black) 
or control (grey) showing the TMB product absorbance at 560 nm after 3 min. incubation. (B) Ad-ySUMOs were expressed in BL21 (DE3) cells and cell lysates 
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and 22 together with the Adhiron scaffold. 
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Table III. Adhiron insert sequences for 24 yeast SUMO-binding Adhirons 
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Characterisation of the ySUMO Adhirons (Ad-ySUMO) 

Due to the high thermal stabihty of the Adhiron scaffold 
(Fig. 2A), we predicted that purification may be aided by intro- 
ducing a heating step to denature and precipitate the majority 
of E. coli proteins without affecting Adhiron integrity. We 
therefore heated lysates for 20 min at 20, 50, 60, 70, 80, 90 
and 100°C, centrifuged to pellet the denatured protein and 
analysed the supernatants by SDS-PAGE (Fig. 4B). The 
heating step dramatically decreased the quantity of bacterial 
protein but did not significantly reduce Adhiron levels. A 
temperature of 50°C was suitable to remove the majority of 
bacterial proteins and so was adopted. Figure 4C demonstrates 
that the purified Ad-ySUMOs show high purity using a 
batch metal affinity purification method. The estimated level 
of Adhiron expressed was ~100mg/l. DSC confirmed the 
selected Adhirons maintained heat stability, showing of 
95.2, 87.7, 87 and 86.7°C for Ad-ySUMO 10, 15, 19 and 22, 
respectively (Fig. 4D). This demonstrates the ability of the 
scaffold to effectively constrain the VRs while maintaining 
high thermal stability of different Adhiron variants. This is 
consistent with previous observations reported for DARPins 
that the introduction of diversity results in some loss of stabil- 
ity (Binz et al, 2003; Kohl et al., 2003; Wetzel et ai, 2008). 
However, unlike DARPins we are introducing 18 variable 
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Fig. 5. Characterisation of yeast SUMO-binding Adhirons, Ad-ySUMO 10, 15, 20 and 22. (A) Biotinylated Ad-ySUMOs were used to detect ySUMO (black). 
Human SUMO 1 (light grey) and human SUMO 2 (dark grey) by ELISA with TMB product detected at 560 nm. (B) Western blots using biotinylated Ad-ySUMO 
clones against yeast SUMO alone (upper panel) and mixed with 20 |j,g of HEK293 cell lysate (lower panel). (C) Western blot analysis using biotinylated 
Ad-ySUMO clones against yeast and human SUMOs 1 and 2. (D) Isothermal calorimetry of Ad-ySUMOs binding to yeast SUMO with the isotherms and the data 
fits. 
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amino acid residues by replacing two short loops of a non- 
repeat protein scaffold and so the retention of such high ther- 
mostability is notable. Fibronectins and the leucine-rich 
repeat-based Repebodies have reported melting temperatures 
of around 90°C (Jacobs et ai, 2012) and 85°C (Lee et ai, 
2012), respectively. 

To evaluate the use of the Adhirons as research reagents, 
the Ad-ySUMOs were biotinylated and used in ELISA 
(Fig. 5A) and western blot analysis (Fig. 5B and C). The 
Ad-ySUMOs show selectivity by binding to yeast SUMO but 
not to human SUMO 1 or 2 (n = 3). To determine the specifi- 
city of the reagents, yeast SUMO was mixed with HEK293 
cell lysates. Interestingly, Ad-ySUMO 10 and 15 show specific 
binding to yeast SUMO with no binding to other proteins 
whereas Ad-ySUMO 19 and 22 bind to other proteins in the 
lysates (n = 3), indicating that the inserted sequences are more 
promiscuous, emphasising the importance of strict negative 
screens during panning. The affinities of the Adhirons for 
ySUMO were measured by isothermal titration calorimetry 
(ITC). Ad-ySUMO 10, 15, 20 and 22 bound with Kq values of 
29.6 ± 0.8, 33.3 + 1.8, 45.8 + 8.8 and 47+ 15.6 nM, re- 
spectively (Fig. 5D). Similar affinities were observed using the 
Blitz™ analysis (ForteBio) when the Adhirons were immobi- 
lised on a biosensor surface which validates the use of the 
Blitz™ for rapid estimation of binding affinity ranges. These 
Kd values are in line with those observed for antibodies. By 
contrast, the /To values for human SUMO 1 and 2 were 
>10^-fold higher at 15 and 50 jjlM, respectively (data not 
shown). The binding affinities of human SUMO 1 and 2 to a 
peptide bearing the conserved IDLT SIM motif have been 
reported to be in the range of 5-6 |xM (Song et ai, 2005; Li 
et ai, 2010), which is in agreement with our data. The higher 
affinity for yeast SUMO than the human SUMOs indicates that 
sequences other than the IDLT in VR 1 and/or 2 play a dis- 
criminating role in ySUMO binding. The affinities obtained 
for the yeast SUMO binders are in the same range as those 
against other targets reported for DARpins (Schweizer et ai, 
2007; Steiner et ai, 2008) and monobodies (Gilbreth et ai, 
2011; Grebien et ai, 2011), suggesting that the Adhiron scaf- 
fold is competitive with other non-antibody binding proteins. 

Further example screens 

To further evaluate the functionality of the Adhiron library, we 
identified binders to other targets, including the growth factor 
(FGFl), receptor (CD31), SH2 domain Grb2 and a peptide se- 
quence. All screens were performed over three panning 
rounds. Phage ELISA was used to examine the ability of 
recovered Adhirons to bind to the corresponding target 
(Fig. 6). While the majority of the FGFl, CD31 and Grb2 
Adhirons showed specific binding, this was not the case for 
the peptide screen. In this case, specificity was enhanced by 
increasing the number of panning rounds. This is not unex- 
pected due to the small size and limited likelihood of appropri- 
ate epitope presentation by the peptide compared with larger 
proteins likely to allow better presentation of single or mul- 
tiple epitopes. To further confirm that expressed Adhirons 
bind to their targets, we have used the Blitz™ to analyse three 
distinct recombinant Adhirons for both CD3 1 and the peptide 
target. The Adhirons were expressed and purified as soluble 
proteins. The K^^ values for CD31 Adhirons ranged from 
8.5 X 10"^ to 6.8 X 10"'' M, while those for the peptide 
ranged from 3.3 x 10"* to 3.5 x 10"*. These data further 



1.5n 



0) 

o 1.0- 



growth factor 



control 



M ^ ^ ^ 



1 



clone ID 

^ 1.5n ^1 Receptor |H control 



S 1.(H 



S 
o 



0.5- 



0.0- 



^.5^^ 



clone ID 
^1 Peptide WM control 



1.0- 



0) 

u 
c 
re 



O 

(A 

^ 0.H 



0.0- 



N <s <<5 A 



1) ^^<^<!>^^^^<^,^ 



clone ID 



Fig. 6. Phage ELISA results for Adhirons identified in screens against (A) 
growth factor protein FGFl, (B) cell surface receptor CD31 and (C) a peptide. 
Graphs show absorbance readings of each well after the addition of TMB. 
Wells containing the target molecule are shown in black and control wells are 
shown in grey. 



demonstrate that the phage ELISA identifies high affinity 
Adhirons. 

Analysis of our screen against Grb2 offered the opportunity 
to directly compare the efficacy of our Adhiron library against 
well-established ScFv and Fab libraries. A recent study 
reported high throughput screens of 20 SH2 domains using 
both hybridoma and phage-display libraries of ScFv's 
(Colwill et ai, 2011). This allows comparison of the results 
for Grb2 from this study with our screen against Grb2. The hit 
rates for two ScFv and a Fab library were 4, 43 and 33% (410 
clones screened in total) with 5, 2 and 6 unique clones identi- 
fied from each screen, respectively. In our screen, the hit rate 
was 92% with over 30 unique binders, although not all were 
sequenced. Our Adhiron purification success rate was >95% 



153 



C.Tiede et al. 



with typical yields of 50-100 mg/1 compared with the 
reported purification success rate of 80% with yields of 
0.6- 10 mg/1. This is a limited comparative study but demon- 
strates the complexity of the Adhiron library and the potential 
for generating specific protein binding reagents. 

Conclusions 

We have developed a new artificial binding protein scaffold 
termed Adhiron based on a designed consensus phytocystatin 
protein. The properties of this scaffold also match the criteria 
proposed for artificial therapeutic proteins (Carter, 2011) al- 
though it remains to be determined whether Adhirons will be 
suitable for such applications. The design of the scaffold and 
library provides a system based on a highly stable scaffold 
with extended flexible binding regions that are expected to 
adapt to form appropriate molecular contacts with a wide 
range of targets allowing interactions with protein pockets, 
protein surfaces, peptides and small molecules. The system 
achieves high-level purification of soluble Adhiron (typically 
10-100 mg/1) from E. coli, by including a heat enrichment 
step enabling ease of engineering and manufacture. The scaf- 
fold displays high thermostability with a melting temperature 
of lore determined by DSC (Fig. 2A). 

We have solved the X-ray crystal structure of the Adhiron 
scaffold reported here, as well as selected Adhirons in 
complex with target proteins which will be reported else- 
where. The ability to gain structural information is important 
for improving our understanding of the molecular interactions 
of Adhirons with their targets that lead to functional conse- 
quences, and to provide a basis for drug design. We are cur- 
rently exploring the extent to which the Adhiron scaffold may 
offer a platform for selection of reagents for applications in- 
cluding research tools, diagnostic, imaging, therapeutic agents 
and for drug discovery. 
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Supplementary material is available at PEDS online. 
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